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INTRODUCTION 


The  broad  aim  of  this  proposal  is  to  test  the  hypothesis  that  the  coordinate  action  of  proteases 
and  protease  inhibitors  are  responsible  for  growth  factor  activation  and  release  in  the  human  breast 
cancer  cell,  and  for  the  progression  of  breast  cancer  in  vivo.  One-third  of  all  cases  of  advanced  breast 
cancer  are  estrogen  responsive,  and  recent  epidemiological  studies  and  studies  using  human  breast 
cancer  cells  in  culture  strongly  suggest  a  correlation  between  estrogens  and  the  pathogenesis  of  breast 
cancer  (Harris  et  al  1992).  The  mechanisms  by  which  this  occurs  are  not  entirely  clear.  Certainly, 
estrogens  have  a  direct  effect  on  cell  growth.  In  addition,  they  can  stimulate  the  expression  and  release 
of  a  variety  of  polypeptide  growth  factors.  It  is  highly  likely  that  the  tummorigenic  effects  of  estrogens 
are  due,  at  least  in  part,  to  the  autocrine/paracrine  action  of  these  factors.  Several  of  these 
polypeptides,  including  epidermal  growth  factor  (EGF),  and  its  analogs  heregulin  and  transforming 
growth  factor-a  (TGFa)  and  the  insulin-like  growth  factors  (IGF-I  and  IGF-II),  have  been  shown  to 
require  pericellular  proteolysis  for  activation  or  release  (Massague  and  Pandiella  1993,  Hooper  et  al 
1997).  To  achieve  homeostasis  in  a  normal  breast  epithelial  cell,  levels  of  these  pericellular  growth 
factor  activating  proteases  also  must  be  regulated.  We  hypothesized  that  this  was  accomplished  by  the 
action  of  locally  synthesized  protease  inhibitors.  Thus  an  imbalance  in  the  ratio  between  local  levels  of 
particular  proteases  and  protease  inhibitors  could  be  responsible  for  increases  in  tumorigenic  potential. 

TGFa,  a  peptide  structurally  and  functionally  related  to  EGF,  interacts  with  the  EGF  receptor 
(EGFR)  and  elicits  a  mitogenic  response  in  a  variety  of  cells  (Lee  et  al  1995).  TGFa  expression  occurs 
in  normal  breast  tissue,  breast  tumors  and  breast  cancer  cells  in  culture  and  TGFa  has  been  proposed 
to  act  as  a  major  autocrine  mediator  of  estrogen-stimulated  growth  in  estrogen-dependent  breast 
cancer  cells  (Harris  et  al  1992).  Expression  of  the  TGFa/EGFR  pair  have  been  shown  to  be  associated 
with  proliferation  and  angiogenesis  in  invasive  breast  cancer  (De  Jong  et  al  1998).  TGFa  is 
synthesized  as  part  of  a  20-22  kDa  glycosylated,  type  I  membrane  protein  precursor  (proTGFa)  which 
can  be  processed  intracellularly  and  extracellularly  by  glycosylation  and  proteolysis  to  yield  a  family  of 
polypeptides  of  from  6  to  17  Kda  in  size.  (Massague  and  Pandiella  1993,  Baselga  et  al,  1996). 
Proteolytic  processing  occurs  in  two  steps,  the  first  results  in  cleavage  between  Ala39  and  Val40 
(Scheme  1).  The  second,  occurring  closer  to  the  cell  membrane,  results  in  release  of  the  6  KDa  mature 
TGFa  peptide.  While  it  is  generally  accepted  that  the  second  cleavage  occurs  between  Ala89  and  Val90, 
this  does  not  rule  out  cleavage  at  another  site  (e.g  between  Lys96  and  Lys97).  Complete  processing 
does  not  occur  to  the  same  extent  in  all  tissues  and  it  is  certainly  possible  that  shedding  of  TGFa  from 
the  cell  surface  may  be  accomplished  by  different  proteases  in  different  tissues  (Hooper  et  al  1997, 
Arribas  et  al  1996).  While  all  TGFa  forms  appear  to  possess  some  degree  of  biological  activity,  there  is 
good  evidence  that  particular  biological  actions  may  depend  on  the  degree  of  proteolytic  processing. 

i  I 

NH2-Ala-Ala-Ala-Val-Val - Leu-Leu-Ala-Val-Val-Ala-Ala-Ser-GIn-Lys-Lys-GIn-Aja— COOH 

39  40  89  90  96  97  99 

Scheme  1.  Cleavage  sites  in  ProTGFa  yielding  mature  50  aa  TGFa  peptide.  Arrows  indicate  the 

normal  cleavage  sites.  Ala99  begins  the  hydrophobic  membrane-spanning  region. 

Results  from  this  laboratory  have  clearly  demonstrated  a  relationship  between  tumorigenicity  of 
MCF-7  human  breast  cancer  cells  (as  measured  by  growth  in  soft  agar),  endogenous  synthesis  of  the 
protease  inhibitor  arantitrypsin  (arAT)  and  release  of  TGFa  (Tamir  et  al  1990,  Finlay  et  al  1993a, b). 
Growth  in  soft  agar  was  blocked  by  arAT  whether  added  to  the  tissue  culture  media  or  synthesized  by 
the  tumor  cell  itself.  A  useful  tool  in  these  studies  was  a  new  MCF-7  cell  subline,  producing  10-fold 
higher  levels  of  arAT  than  its  parental  cell  line,  constructed  by  stable  transfection  with  an  arAT  cDNA 
(Yavelow  et  al  1997).  Growth  in  soft  agar  and  release  of  TGFa  was  decreased  in  cells  transfected  with 
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the  a^AT  cDNA  when  compared  to  cells  transfected  with  vector  alone.  Consistent  with  the  above  we 
had  identified  a  serine  protease  with  elastase-like  activity,  capable  of  forming  a  stable  complex  with  a^ 
AT,  on  the  MCF-7  cell  surface. 

Our  specific  aims  as  stated  in  our  grant  application  were  to: 

1.  To  identify  and  clone  the  growth-modulating  pericellular  proteases  from  MCF-7  cells, 
particularly,  the  elastase-like  enzyme(s)  that  are  able  to  effect  the  release  of  TGFa  from  the 
tumor  cell  surface. 

2.  To  show  that  the  ability  of  MCF-7  sublines  to  form  colonies  in  soft  agar  and  tumors  in  nude 
mice  is  a  function  of  their  expression  of  a^AT  and  specific  pericellular  proteases. 

3.  To  extend  our  observations  relating  to  TGFa  release/activation  made  in  MCF-7  human  breast 
cancer  cells  in  culture  to  other  breast  cancer  cell  lines,  and  to  normal  and  malignant  human 
breast  tissue. 

4.  To  test  the  hypothesis  that  the  pericellular  proteaseiprotease  inhibitor  ratio  responsible  for 
TGFa  release  can  be  regulated  by  ST-3  and  other  potential  effectors  such  as  antiestrogens, 
phorbol  esters  and  SEC  receptor  agonists.  This  information  may  provide  insight  into 
mechanisms  by  which  protease  and  protease  inhibitor  levels  may  be  independently  controlled. 

To  accomplish  the  above  specific  aims,  we  proposed  to  carry  out  the  sequence  of  studies  described  in 
our  Statement  of  Work.  While  the  project  has  not  proceeded  in  exactly  the  order  as  originally 
anticipated,  I  believe  that  we  are  fairly  well  on  track.  Our  progress  is  outlined  below. 

YEAR  STATUS  OF  PROJECTED  STUDY 

1-2  Isolate,  characterize  and  clone  proteases  from  MCF-7  cells.  This  phase  of  the  work  has 
been  completed.  A  revised  manuscript  describing  the  cloning  and  properties  of  the  TGFa 
cleavage  protease  (which  we  now  call  human  epithin)  has  been  submitted  for  publication. 

1-3  Compare  production  of  protease  and  protease  inhibitors  by  MCF-7  cell  sublines  with  their 
ability  to  form  colonies  in  soft  agar  and  cause  tumor  formation  in  nude  mice.  Because  of 
difficulties  in  generating  the  MCF-7  cell  clones  hyper-producing  pro-TGFa,  PCR-7  protease, 
and  a1-antitrypsin-(Pittsburgh)  and  because  of  problems  with  the  identity  of  the  TGFa 
cleavage  enzyme,  this  project  is  a  little  behind  schedule.  Studies  of  colony  formation  in  soft 
agar  have  been  completed.  We  have  just  initiated  the  nude  mouse  experiments.  This  phase 
of  the  work  should  be  completed  within  60  days. 

1- 3  Extend  our  observations  relating  to  TGFa  release/activation  made  in  MCF-7  human  breast 

cancer  cells  in  culture  to  other  breast  cancer  cell  lines,  and  to  normal  and  malignant  human 
breast  tissue.  We  have  demonstrated  production  of  PCR-7  protease  in  normal  and 
malignant  breast  epithelial  tissue  and  anticipate  looking  at  TGFa  shedding  in  other  breast 
cancer  cell  lines  and  in  breast  epithelial  tissue  over  the  next  several  months.  Breast  cancer 
cell  studies  using  the  non-transformed  cell  line  MCF-10A,  the  ER-negative  cell  lines  BT-20 
and  T47D  and  a  second  ER-positive  cell  line,  ZR-65-1  are  being  completed  in  my 
laboratory. 

2- 4  Examine  production  and  localization  of  proteases,  protease  inhibitors,  growth  factor 

receptors  and  sites  of  growth  factor  activation  in  normal  and  malignant  breast  tissue.  In 
collaboration  with  Dr.  Helen  Feiner  (Department  of  Anatomic  Pathology)  we  have  begun  to 
look  at  histochemical  localization  of  PCR-7  protease,  EGF  receptor  and  ProTGFa  in  normal 
and  maligmnant  breast  tissue.  These  studies  are  continuing. 
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1-4  Test  the  hypothesis  that  the  pericellular  protease:protease  inhibitor  ratio  responsible  for 
TGFa  release  can  be  regulated  by  ST-3  and  other  potential  effectors  Identify  potential 
modulators  of  protease  inhibitor:  protease  ratio  and  growth  factor  activation/release  in 
breast  cancer  cells  and  in  ductal  epithelium  from  human  breast.  These  studies  have  been 
initiated 

RESULTS  AND  DISCUSSION 

1.  Cloning  of  Human  Epithin:  a  Potential  TGFa  Releasing  Serine  Protease. 

Degenerate  oligonucleotides  based  on  the  conserved  sequences  about  the  his57,  asp102  and 
ser195  residues  in  mammalian  serine  proteases  (chymotrypsin)  were  used  to  clone  a  potential  TGFa- 
releasing  serine  protease  from  an  MCF-7  breast  cancer  cell  cDNA  library.  In  the  initial  step,  a  series  of 
approximately  500  bp  fragments  between  the  his57  and  ser195  sites  were  amplified  by  PCR.  The 
amplified  sequences  were  then  cloned  into  a  PCR  cloning  vector,  which  was  used  to  construct  a  mini- 
cDNA  library.  Clones  from  the  mini  library  were  selected  by  Southern  blotting  using  a  32P-labeled 
degenerate  oligonucleotide  probe  based  on  the  sequence  about  asp102.  Several  positive  clones  were 
sequenced.  The  460  bp  sequence  of  one  strongly  hybridizing  clone,  designated  PCR-7,  showed  a  high 
degree  of  homology  to  known  serine  proteases  (64%  identity  in  a  220  bp  region  at  the  3’  terminus  to 
human  trypsinogen-B  and  a  60%  identity  in  a  120  bp  region  around  the  5’  terminus  to  human 
pancreatic  protease).  Northern  blot  analysis,  using  a  PCR-7  cDNA  probe,  showed  the  expression  of  an 
approximately  4.2  kb  mRNA  species  in  total  RNA  from  MCF-7  cells  (Fig.  1).  Similar  mRNA  species 
were  expressed  in  a  several  different  human  breast  cancer  cell  lines  and  in  normal  human  breast 
tissue,  in  trophoblast  from  term  human  placenta  and  in  proliferative  phase  uterine  glandular  epithelium. 

Figure  1.  Northern  blot  analysis 
using  the  500  b  PCR-7  cDNA 
probe.  Total  RNA,  prepared  from 
various  human  tissues  and  cell 
lines  as  indicated  below,  were 
electrophoresed  on  a  1 .5% 
agarose/  formaldehyde  gel, 
transferred  to  GeneScreen  nylon 
membrane.  Membranes  were 
probed  with  a  32P-labelled  PCR-7 
cDNA  insert  and  then  reprobed  with  a  32P-labelled  actin  cDNA.  Lane  1,  Terminal  duct  lobular  Unit  from 
human  breast  ductal  epithelium;  lane  2;  human  ductal  glandular  epithelial  cells;  lane  3,  trophoblast  cells 
from  term  human  placenta;  lanes  4-7,  various  human  breast  cancer  cell  line;  lanes  8-9,  MCF-7  breast 
cancer  cells  and  HepG2  human  liver  cancer  cell  lines 

Two  cDNAs  of  approximately  3.8  and  3.2  kb  containing  the  PCR-7  sequence  were  cloned  from 
a  second  MCF-7  cell  cDNA  library  using  the  PCR-7  sequence  as  a  probe.  Both  clones  hybridized  to  the 
same  sized  mRNA  as  did  the  PCR-7  sequence  used  to  screen  the  library.  Both  cDNAs  have  been 
sequenced  and  except  for  an  additional  600  nucleotides  in  the  5'  terminus  of  the  larger  clone  and  some 
minor  differences  in  the  3’  sequence,  the  two  sequences  appear  to  be  identical  where  they  overlap. 
Because  of  inconsistencies  between  the  MW  of  the  TGFa  releasing  protease  calculated  from  the 
deduced  amino  acid  sequence  and  the  apparent  MW  observed  on  western  blotting,  the  larger,  3.8  kb, 
cDNA  has  been  resequenced  for  a  third  time  and  we  believe  all  ambiguities  have  finally  been  resolved 
(Fig.  2).  The  sequence  has  a  2565  b  open  reading  frame  coding  for  an  855  amino  acid  protein  of 
approximately  95  kDa  and  a  542  b  5’  sequence  following  the  stop  codon  at  position  2721 .  A  Kozack 
consensus  sequence  for  initiation  of  eukaryotic  translation  is  present  at  the  putative  methionine 
initiation  codon  at  position  156  (Kozack  et  al  1990). 
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Figure  2.  Complete  nucleotide  sequence  of  human  epithin/matriptase  cDNA  and  its  deduced 
amino  acid  sequence.  The  potential  membrane-spanning  region  is  doubly  underlined.  The  two 
putative  CUB  domains  have  black  letters  on  a  gray  background  while  the  four  putative  LDRa  domains 
have  white  letters  on  a  black  background.  The  poteintial  protease  activation  site  after  Arg614  is  marked 
by  an  arrow.  The  residues  at  the  putative  catalytic  site  (His656,  Asp711  and  Ser805)  are  underlined  and  in 
boldface. 

1  gagcgcgccagggcgagggcaccgccgccggtcgggcgcgctgggcctgcccggaatcccgccgcctgcgccccgcgccccgcgccctgcgggccatgggagccggccgccggcagggac  120 

121  gacgcctgtgagacccgcgagcggcctcggggaccatgggtagcgatcgggcccgcaagggcggagggggcccgaaggacttcggcgcgggactcaagtacaactcccggcacgagaaag  240 
1  MGSDRARKGGGGPKDFGAGLKYNSRHEKV  29 

241  tgaatggcttggaggaaggcgtggagttcctgccagtcaacaacgtcaagaaggtggaaaagcatggcccggggcgctgggtggtgctggcagccgtgctgatcggcctcctcttggtct  360 
30  NGLEEGVE  FLPVNNVKKVEKHGPGRWV  V  L  A.  A  V  L  I  G  L  L  L  V  L  69 

361  tgctggggatcggcttcctggtgtggcatttgcagtaccgggacgtgcgtgtccagaaggtcttcaatggctacatgaggatcacaaatgagaattttgtggatgcctacgagaactcca  480 
70  LGIGFLVW  HLQYRDVRVQKVFNGYMRITNENFVDAYENSN  109 

481  actccactgagtttgtaagcctggccagcaaggtgaaggacgcgctgaagctgctgtacagcggagtcccattcctgggcccctaccacaaggagtcggctgtgacggccttcagcgagg  600 
110  STEFVSLASKVKDALKLLYSGVPFLGPYHKESAVTAFSEG  149 

601  gcagcgtcatcgcctactactggtctgagttcagcatcccgcagcacctggtggaggaggccgagcgcgtcatggccgaggagcgcgtagtcatgctgcccccgcgggcgcgctccctga  720 
150  SVIAYYWSEFS  I  PQHLVEEAERVMAEERVVMLPPRARSLK  189 

721  aqtcctttqtqqtcacctcaqtggtggctttccccacggactccaaaacagtacagaagacccaggacaacagctgcagctttggcctgcacgcccgcggtgtggagctgatgcgcttca  840 
190  SFVVTSVVAFPTDSKTVQKTQDNS  jp . V  B . L  M  R  F  T  229 

841  ccacgcccggcttccctgacagcccctaccccgctcatgcccgctgccagtgggccc^cggggggacgccgactcagtgctgagcctgaccttccgcagctttgaccttgcgtcctgcg  960 
230  . ’D* "  g" ~ p” ” y  p’ *  A  H  A  R  C  Q~  W  A  L  R  G  D  A  'd”" S "  V*  S  L  T  F  R  S  F  D  L  A  S  ,C  D  269 

961  acqaqcqcqqcaqcqacctqqtqacqgtqtacaacaGcttgagccccatggagccccacgccctggtgcagttgtgtggcacctaccctccctcctacaacctgaccttccactcctccc  1080 

270  E  R  G  S  D  fT  V  T  V  Y . N  T  L  S  P'~M  E  P  H  A  '  "  ‘L . V  Q  L  ‘  C  G  T  Y . P"' P  S  Y  '  N '  'L  'T  F  H  S'S  Q  309 

1081  aqaacqtcctgctcatcacactgataaccaacactgagcggcggcatcccggctttgaggccaccttcttccagctgcctaggatgagcagctgtggaggccgcttacgtaaagcccagg  1200 

310  . t  "L  I . T  N  T . E~  R*  '  R'"'  H~’~P  ””E '  A  "T  F  F  Q  L  P  R  M  S  S  jc  G  G  R  h  R  K  A  Q  G  349 

1201  ggacattcaacagcccctactacccaggccattacccacccaacattgactgcacatggaacattgaggtgcccaacaaccagcatgtgaaggtgcgcttcaaattcttttacctgctgg  1320 

350  |"'f~~;kT;¥,'P" 'Y  TT G" "  VT’r  P  P"n  ’~I D“"c"  "T"'iT"N  'T E"‘V  V"'R' F"' ' K”  F  .  "F  Y  L  L  E  389 

1321  agcccggcgtgcctgcgggcacctgccccaaggactacgtggagatcaacggggagaaatactgcggagagaggtcccagttcgtcgtcaccagcaacagcaacaagatcacagttcgct  1440 
390  jp‘G.  V  P  A  G  TCP  K  D  'Y . V  E  I  N . G  '  E  K  Y  C  G  E'  R  S  'Q  F  V  T  '  S  N  S  N . ~K  I  T  . V  R  F  429 

1441  tccactcaqatcaqtcctacaccqacaccqqcttcttagctgaabacctntcctacgactccagtgacccatgcccggggcagttcacgtgccgcacggggcggtgtatccggaaggagc  1560 
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H  S  D  Q  S  Y  T  D  T  G  F  L  A  E  Y  L  S  Y  D  S  S  D  P 

tgcgctgtgatggctgggccgactgcaccgaccacagcgatgagctcaactgcagttgcgacgccggccaccagttcacgtgcaagaacaagttcbgcaagcecctcttctgggtctgcg 


acagtgtgaacgactgcggagacaacagcgacgagcaggggtgcagttgtCCGGCCCAGACCTTCAGGTGTTCCAATGGGAAGTGCcTcTCGAAAAGCCAGCAGTGCAATGGGAAGGACG 


ACTGTGGGGACGGTTCCGACGAGGCCTCCTGCCCCAAGGTGAACGTCGTCACTTGTACCAAACACACCTACCGCTGCCTCAATGGGCTCTGCTTGAGCAAGGGCAACCCTGAGTGTGACG 

§g|g£|»H£BEiEK9QEH^E  k  v  n  v  t 

GGAAGGAGGACTGTAGCGACGGCTCAGATGAGAAGGACTGCGACTGTTGGCTGCGGTCATTCACGAGACAGGCTCGTGTTGTTGGGGGCACGGATGCGGATGAGGGCGAGTGGCCCTGGC 

CWLRSFTRQARVVGGTDADEGEWPWQ 

t 

AGGTAAGCCTGCATGCTCTGGGCCAGGGCCACATCTGCGGTGCTTCCCTCATCTCTCCCAACTGGCTGGTCTCTGCCGCACACTGCTACATCGATGACAGAGGATTCAGGTACTCAGACC 

vslhalgqghicgaslispnwlvsaaHcyiddrgfrysdp 

CCACGCAGTGgacggccttCCTGGGCTTGCACGACCAGAGCCAGCGCAGCGCCCCTGGGGTGCAGGAGCGCAGGCTCAAGCGCATCATCTCCCACCCCTTCTTCAATGACTTCACCTTCG 

TQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFD 

ACTATGACATCGCGCTGCTGGAGCTGGAGAAACCGGCAGAGTACAGCTCCATGGTGCGGCCCATCTGCCTGCCGGACGCCTCCCATGTCTTCCCTGCCGGCAAGGCCATCTGGGTCACGG 

yDiallelekpaeyssmvrpiclpdashvfpagkaiwvtg 

gctggggacacacccagtatggaggcactggcgcgctgatcctgcaaaagggtgagatccgcgtcatcaaccagaccacctgcgagaacctcctgccgcagcagatcacgccgcgcatga 

wghtqyggtgalilqkgeirvinqttcenllpqqitprmm 

tgtgcgtgggcttcctcagcggcggcgtggactcctgccagggtgattccgggggacccctgtccagcgtggaggcggatgggcggatcttccaggccggtgtggtgagctggggagacg 

cvgflsggvdscqgd^ggplssveadgrifqagvvswgdg 

gctgcgctcagaggaacaagccaggcgtgtacacaaggctccctctgtttcgggactggatcaaagagaacactggggtataggggccggggccacccaaatgtgtacacctgcggggcc 

caqrnkpgvytrlplfrdwikentgv 

acccatcgtccaccccagtgtgcacgcctgcaggctggagactggaccgctgactgcaccagcgcccccagaacatacactgtgaactcaatctccagggctccaaatctgcctagaaaa 

cctctcgcttcctcagcctccaaagtggagctgggaggtagaagggttctactgacccaactgggggcaaaggtttgaagacacagcttcccccgccagccccaagctgggccgaggcgc 

gtttgtgtatatctgcctcccctgtctgtaaggagcagcgggaacggagcttcggagcctcctcagtgaaggtggtggggctgccggatctgggctgtggggcccttgggccacgctctt 

gaggaagcccaggctcggaggaccctggaaaacagacgggtctgagactgaaattgttttaccagctcccagggtggacttcagtgtgtgtatttgtgtaaatgagtaaaacattttatt 
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The  deduced  amino  acid  sequence  from  the  open  reading  frame  contains  the  His57,  Asp102  and 
Ser195  motifs  characteristic  of  serine  proteases  (Val652-Cys657,  Asp711-Leu714  and  Asp^-Ser810 
respectively)  and  is  compatible  with  the  SI  family  of  the  SA  clan  of  serine-type  petidases  (Rawlings 
and  Barrett,  1994).  A  potential  serine  protease  activation  sequence  (RWGG)  is  present  at  residues 
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p6i4_Q6i8  suggesting  that  human  epithin  is  synthesized  as  a  single  chain  zymogen.  While  there  is  no 
apparent  signal  peptide  sequence,  Kyle-Doolittle  hydropathy  plots  using  the  GCG  Peptide  Structure 
program  (Fig.  3A)  indicated  the  presence  of  a  highly  hydrophobic  region  between  residues  V57  and  W77 
consistent  with  a  transmembrane  spanning  region.  Similar  results  were  obtained  using  the  TMpred 
(Hofman  &  Stoffel,1993)  and  TopPred  2  programs  (Claras  &  von  Heijne,  1994).  A  ProfileScan  of  the 
ORF  against  the  Prosite  library  showed  two  potential  CUB  domains  (residues  214-334  and  340-447) 
and  four  potential  LDL-receptor  class  A  domains  (residues  452-487,  487-524,  524-560  and  566-603). 
Epithin  has  40  putative  external  cysteine  residues.  Each  LDR  repeat  contains  six  cysteine  residues 
presumably  in  3  internal  disulfide  linkages  (Brown  et  al  1997).  Each  of  the  two  CUB  domains  contains 
four  cysteine  residues  in  disulfide  linkage  which,  also  most  likely  are  in  internal  disulfide  linkages  (Bork 
&  Beckman  1993).  The  catalytic  domain  has  8  cysteine  residues  in  four  disulfide  linkages  analogous  to 
chymotrypsin  (i.e.  Cys604-Cys732,  Cys641-Cys657,  Cys776-Cys790,  Cys801-Cys830).  Cys604-Cys732  serves  to  link 
the  catalytic  and  regulatory  domains.  A  schematic  representation  of  the  human  epithin  domain  structure 
is  shown  in  Fig.  3B. 

Using  BLAST,  an  80.3%  identity  in  843  amino  acid  overlap  was  found  between  the  deduced 
amino  acid  sequence  and  the  deduced  sequence  of  epithin  a  putative  protease  cloned  from  a  mouse 
thymocyte  cDNA  library  (Kim  et  al  1999).  An  almost  perfect  identity  at  the  cDNA  level  was  found 
between  the  MCF-7  cell  protease  and  matriptase,  a  683  amino  acid  serine  protease  recently  cloned 
from  T47-D  breast  cancer  cells  (Lin  et  al  1999).  A  45-55%  identity  at  the  amino  acid  level  was  found 
between  the  human  epithin  and  the  human  serine  proteases  (or  their  zymogens)  enterokinase 
precursor  (Kitamoto  et  al  1995),  hepsin  (Leytus  et  al  1988),  prekalikrein  (Chung  et  al  1986),  TMPRSS2 
protease  (Paolino-Giacobino,  et  al  1997),  prostasin  (Yu  et  al  1995)  and  drosophila  protease  stubble 
(Appel  et  al  1993)  (Fig.  3C).  Significantly,  enterokinase  precursor,  hepsin,  prekalikrein,  and  prostasin 
are  all  cell  membrane-bound  proteases.  Epithin  most  likely  shows  specificity  for  cleavage  of  peptide 
bonds  after  Lys  or  Arg  residues  as  it,  like  trypsin,  hepsin  and  enterokinase  contains  an  Asp  (Asp266)  at 
the  base  of  the  specificity  pocket  (SI  subsite).  Elastase  and  chymotrypsin-like  enzymes  have  cysteine 
and  serine  residues,  respectively,  at  this  site. 


Figure  3.  A.  Hydropathy  plot  of  the  deduced  human  epithin  amino  acid  sequence  by  the  Kyle-Doolittle 
method.  B.  Schematic  representation  of  the  multi-domain  structure  of  human  epithin.  Numbers 
correspond  to  the  deuced  amino  acid  sequence  derived  from  the  full-length  cDNA  shown  in  Fig.  2.  C. 
Comparison  of  the  amino  acid  sequences  of  human  epithin  and  related  proteins. 
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JiDSKTVQ|TQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPS| 

|tdsktvqrtqdnscsfglhargvelmrfttpgfpdspypaharcqwalrgdadsvlsltfrsfdlascdergsdlvtvyntlspmephalvqlcgtypps| 
~^QRTQDNSCSF^LHAflngBvSpFTTPGFP^SPYPAHARCQWgLRGDADSVLSLTFRSFDP.^BrDEgGSDLVTVYpgLSPMEPHA^\/igLCGT^PF| 
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H_Epithin  600 
H_Matriptase  428 
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H_Enterokinase  65 
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2.  Activation  and  Release  of  Human  Epithin  from  MCF-7  Cells 

Western  blotting  shows  that  both  the  parental  MCF-7  subline  and  a  subline  stably  transfected 
with  a  full-length  from  epithin  cDNA  release  the  same  amount  of  an  approximately  84  kDa  fragment  into 
the  media  (Fig.  4A).  The  nature  of  the  fragment  is  unclear  although  it  must  contain  the  catalytic  domain 
as  this  region  was  used  to  prepare  the  antibody.  Whether  the  fragment  is  generated  by  proteolytic 
cleavage  or  results  from  altered  splicing  is  presently  under  investigation,  although  the  later  possibility 
appears  to  be  remote  as  only  a  single  4.2  kb  transcript  is  seen  on  Northern  blots  (Fig.  1).  There  is  a 
prominent  33  kDa  band  in  reduced  samples  in  the  spent  media  from  transfected  cells  which  is  barely 
visible  in  spent  media  from  the  parental  cells  suggesting  that  it  may  be  a  consequence  of 
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autoprocessing.  Undoubtedly,  this  is  the  catalytic  domain  as  membrane  fractions  from  MCF-7  cells 
labeled  with  the  serine  protease  inhibitor  [3H]-diisopropyl  flurophosphate,  showed  a  33  kDa  [3H]-labeled 
band,  which  also  was  recognized  by  the  epithin  antibody  after  western  blotting  (Fig.  5).  The  fact  that 
treatment  with  phorbol  12-myristate  13-acetate  (PMA)  was  without  effect  suggests  that  epithin 
activation  and  release  of  TGFa  may  not  be  directly  connected.  The  observation  that  addition  of  the 
serine  protease  inhibitor  phenylmethylsulfonyl  fluoride  (DIFP)  to  the  spent  media  after  collection  also 
had  no  effect  suggests  that  activation  occurs  rapidly. 
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Figure  4.  Identification  of  Epithin  on  MCF-7  Cell  Membranes  or  Released  into  the  Media.  A. 

Confluent  MCF-7  ML  (-xfct)  cells  and  MCF-7  ML  cells  transfected  with  a  full  length  epithin  cDNA  (+xfct) 
were  washed  2  times  with  serum-free  media  before  incubation  with  serum-free  media  containing  10'8  M 
estradiol.  After  21  hours  PMA  (50  ng/ml)  was  added  to  half  the  flasks  and  the  incubation  was  allowed 
to  continue  for  an  additional  4  hours.  Conditioned  media  were  collected,  clarified  by  centrifugation  and 
ammonium  sulfate  was  added  to  65%  saturation  with  continuous  mixing.  After  overnight  incubation  at 
4°  the  samples  were  centrifuged  (14,000  x  g,  20  minutes),  the  pellets  resuspended  in  10  mM 
phosphate  buffer  (pH  6.8)  and  dialyzed  overnight  against  the  same  buffer.  Membrane  fractions  were 
prepared  by  washing  cells  3  times  with  PBS,  scraping  in  20mM  Tris-HCI,  pH  7.4  and  homogenizing  with 
a  Dounce  homogenizer.  After  centrifugation  at  600  x  g  for  10  minutes  the  supernatants  were 
centrifuged  at  20,000  x  g  for  20  minutes.  The  resulting  pellets  were  suspended  in  20  mM  Tris-HCI,  pH 
7.4. 1%  Triton-X-100.  Aliquots  of  precipitated  media  and  membrane  fractions  were  electrophoresed  on 
a  12  %  SDS-polyacrylamide  gel  under  reducing  conditions,  transferred  to  a  PVDF  membrane  and 
probed  with  the  antibody  to  PCR-7.  Some  aliquots  of  media  from  transfected  cells  were  subjected  to 
treatment  with  0.5  mM  PMSF  before  electrophoresis  as  indicated.  B.  Transfected  MCF-7  ML  cells 
were  washed  2  times  with  serum-free  media  then  incubated  for  24  hours  with  serum-free  media  ±  PIC 
(protease  inhibitor  cocktail:  Complete™, Mini  EDTA-free  tablets,  Boehringer  Mannheim,  plus  BB-3103, 
British  Biotech).  Media  were  collected  and  treated  as  above  except  that  ammonium  sulfate  precipitates 
from  media  containing  protease  inhibitor  were  suspended  in  and  dialyzed  against  1 0  mM  phosphate 
buffer  containing  10  mM  EDTA.  Membrane  fractions  from  cells  not  treated  with  protease  inhibitor  were 
prepared  ±  PIC  in  wash  and  suspension  buffer,  while  cells  incubated  with  protease  inhibitors  in  the 
media  were  treated  with  protease  inhibitors  throughout  the  purification  procedure.  Aliquots  of  media 
and  membrane  fractions  were  subjected  to  western  blot  analysis  under  both  reducing  and  non-reducing 
conditions. 
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Western  blotting  of  spent  media  from  MCF-7  cells  recognized  an  approximately  45  kDa 
polypeptide  without,  and  33  kDa  with  reduction  (Fig.  5B).  The  45  kDa  fragment  most  likely  contains  the 
activated  catalytic  chain  and  an  approximately  12  kDa  fragment  from  the  regulatory  chain.  The  nature 
of  the  prominent  45  kDa  band  seen  in  all  of  the  membrane  fractions,  and  whether  it  is  identical  to  the 
slightly  larger  poly  peptide  seen  in  the  spent  media  is  presently  under  investigation.  Significantly,  in 
spent  media  from  cells  incubated  with  the  metalloprotease  inhibitor  BB-3103,  only  a  84  kDa  band  is 
evident  suggesting  a  metalloprotease  may  needed  for  initial  epithin  activation. 
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Figure  5.  Labeling  of  MCF-7  cell  membrane 
proteases  with  3H  DIFP.  Confluent  MCF-7  ML  (- 
xfct)  cells  and  MCF-7  ML  cells  transfected  with  a 
full  length  human  epithin  cDNA  (+xfct)  in  T-25 
flasks,  were  washed  3  times  with  serum-free 
media  and  then  incubated  for  30  min  with  media 
containing  10‘8  M  estradiol  and  50  ng/ml  PMA. 
After  30  minutes,  3H-DIFP  (12.5  pCi/ml)  was 
added  to  each  flask  and  the  incubation  was 
continued  for  an  additional  60  minutes.  Media 
was  then  removed  and  the  cells  washed  5  times 
with  PBS  and  lysed  with  buffer  A  (0.5%  Triton-X- 
100,  150  mM  NaCI,  5  mM  EDTA,  50  mM  Tris-HCI, 
pH  7.4  and  0.02%  NaN3).  A.  Aliquots  of  the  lysed 
cells  were  electrophoresed  on  a  12% 
polyacrylamide  gel  under  reducing  conditions  and 
transferred  to  a  PVDF  membrane.  The  membrane 


was  dried  and  exposed  to  Kodak  Biomax  MS  film  under  a  LE  intensifying  screen  for  6  days  at  -70°.  B. 
The  membrane  was  then  rehydrated  with  methanol  and  subjected  to  Western  blot  analysis  using  the 
antibody  to  PCR-7.  Proteins  were  detected  by  ECL.blot  analysis  using  the  antibody  to  PCR-7. 
Proteins  were  detected  by  ECL. 


Gelatin  zymography  was  used  to  confirm  that  human  epithin,  like  matriptase  has  activity  against 
matrix  proteins  (Fig.  6).  In  the  presence  of  Ca++,  required  for  the  action  of  metalloproteases,  two  bands 
are  apparent  (Fig.  6  left  panel)  in  both  the  parental  and  epithin-transfected  cell  lines.  The  higher,  92 
kDa  band  is  present  at  the  level  in  both  cell  lines  The  lower,  84  kDa  band,  is  considerably  more 
pronounced  in  the  transfected  cells.  When  the  zymograms  were  incubated  in  the  presence  of  EDTA,  an 
inhibitor  of  matrix  metalloproteases,  the  upper  band,  most  likely  the  MMP9  gelatinase,  disappeared. 
The  would  suggest  that  the  lower  band  is  epithin  or  some  other  non-metalloprotease.  That  this  is  the 
case  is  shown  in  the  right  panel  where  the  84  kDa  band  was  made  to  disappear  by  incubation  with 
DIFP  whether  in  the  presence  or  absence  of  EDTA.  Consistent  with  the  western  blotting  experiments, 
PMA  had  no  effect.  The  observation  that  the  84  kDa  activity  is  inhibited  by  DIPF  but  not  by  EDTA  and 
that  it  is  elevated  in  transfected  cells  strongly  suggests  that  the  84  kDa  activity  is  indeed  epithin  and 
that  epithin  may  be  directly  involved  in  invasion. 
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Figure  6.  Effect  of  protease  inhibitors  on  epithin  gelatinase  activity.  Left  panel:  Confluent  MCF-7 
ML  (-xfct)  cells  and  MCF-7  ML  cells  transfected  with  a  full  length  epithin  cDNA  (+xfct)  were  washed  2 
times  with  serum-free  media  before  incubation  with  serum-free  media  containing  10'8  M  estradiol.  After 
21  hours  PMA  (50  ng/ml)  was  added  to  half  the  flasks  and  the  incubation  was  allowed  to  continue  for 
an  additional  4  hours.  Conditioned  media  were  collected,  clarified  by  centrifugation  and  ammonium 
sulfate  was  added  to  65%  saturation  with  continuous  mixing.  After  overnight  incubation  at  4°  the 
samples  were  centrifuged  (14,000  x  g,  20  minutes),  the  pellets  resuspended  in  10  mM  phosphate 
buffer  (pH  6.8)  and  dialyzed  overnight  against  the  same  buffer.  Aliquots  were  electrophoresed  on  10% 
zymogram  gels  containing  gelatin.  Gels  were  incubated  overnight,  washed  with  several  changes  of 
2.5%  Triton-X-100  and  incubated  at  37°  overnight  in  50mM  Tris-HCI,  pH  7.5,  200  mM  Nacl,  0.02%  Brij- 
35  and  either  5mM  CaCI2or  10  mM  EDTA.  Gels  were  then  stained  with  Coomassie  Blue  R-250.  Right 
panel:  Ammonium  sulfate  precipitated  conditioned  media  from  transfected  cells  were  incubated  5  min 
minus  or  plus  10  mM  DIFP  before  electrophoresis  on  10%  zymogram  gels  containing  gelatin.  Gels 
were  treated  as  in  the  left  panel. 


3.  Comparative  Expression  of  Epithin,  ProTGFa  and  Antitrypsin  by  Various  Breast  Cancer 

Cell  Lines.  Three  breast  cancer  cell  lines  in  addition  to  MCF-7  cells  were  examined  for  the  expression 
of  epithin,  a  potential  proTGFa  cleaving  enzyme,  proTGFa  and  ^-antitrypsin  (a,-AT),  a  potential 
regulator  of  TGFa  cleaving  enzyme  activity.  Preliminary  data  from  this  laboratory  suggested  that  TGFa 
from  MCF-7  cells  could  be  blocked  by  o^-AT  and  arAT  could  complex  with  a  cell  surface  protease  on 

MCF-7  cells.  If  our  hypothesis  is  correct,  then  all  four 
cell  lines  should  express  the  three  proteins  and  TGFa 
release  and  tumorigenicity  should  be  related  to  the  ratio 
of  the  expression  of  epithin:  a^AT.  To  test  this 
hypothesis  we  have  first  compared  steady-state  mRNA 
levels  of  epithin  (shown  as  PCR-7),  arAT  and  proTGFa 
in  the  four  breast  cancer  cell  lines. 
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Figure  7  Steady  state  mRNA  levels  of  ai-AT,  epithin 
(PCR-7),  and  proTGFa  in  breast  cancer  cell  lines. 

Total  RNA  was  isolated  and  subjected  to  Northern  blot 
analysis  with  the  indicated  32P-labeled  cDNA  probes. 
Blots  were  then  cleared  and  reprobed  with  a  p-actin 
cDNA.  Autoradiograms  were  quantified  by  densitometry. 
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CONCLUSSIONS 


•  Degenerate  oligonucleotides  based  on  the  conserved  sequences  about  the  his57  and  ser195  residues 
in  mammalian  serine  proteases  were  used  to  clone  a  potential  TGFa  cleaving  enzyme  from  MCF-7 
breast  cancer  cells.  The  3270  bp  cDNA  has  a  deduced  855  amino  acid  sequence  consistent  with 
that  of  a  multi-domain,  type  II  membrane  protease  and  shows  considerable  homology  to  epithin,  a 
recently  described  protein  from  mouse  thymocytes.  Mouse  and  human  epithin  have  similar 
membrane  spanning  and  trypsin-like  serine  protease  domains.  Each  also  has  two  putative  CUB  and 
four  low-density  lipoprotein  receptor  domains,  which  may  be  important  for  regulation  and/or 
substrate  binding.  Matriptase,  a  recently  described  cDNA  from  T47-D  breast  cancer  cells  may  be  a 
partial  epithin  clone. 

•  Northern  blotting  showed  highest  expression  of  epithin  in  human  epithelial  cells  particularly  breast, 
kidney,  trophoblast,  and  uterine  glandular  epithelium.  Southern  analysis  indicated  the  presence  of 
sequences  homologous  to  human  epithin  in  baboon,  mouse,  and  rabbit  but  not  in  chicken  or 
drosophila  DNA. 

•  A  polyclonal  antibody  to  the  epithin  active  site  domain  inhibited  phorbol  ester-induced  TGFa 
release  from  MCF-7  cells  by  >50%  suggesting  that  epithin  is  involved  in  proTGFa  processing. 

•  Western  blotting  of  spent  media  from  MCF-7  cells  recognized  an  approximately  42  kDa  polypeptide 
without,  and  33  kDa  with  reduction.  However,  in  media  from  cells  incubated  with  the 
metalloprotease  inhibitor  BB3103,  only  a  84  kDa  band  was  evident  suggesting  a  metalloprotease 
may  needed  for  epithin  activation.  Membrane  fractions  from  MCF-7  cells  labeled  with  the  serine 
protease  inhibitor  [3H]-diisopropyl  flurophosphate,  showed  a  33  kDa  [3H]-labeled  band,  which  also 
was  recognized  by  the  epithin  antibody  after  western  blotting.  On  gelatin  zymography,  two 
prominent  bands  were  apparent  in  the  presence  of  Ca+\  the  higher  MW  form  disappeared  in  the 
presence  of  EDTA,  an  inhibitor  of  matrix  metalloproteases,  the  lower  in  the  presence  of  inhibitors  of 
serine  proteases.  These  results  suggest  that  epithin  may  be  directly  involved  in  invasion. 

•  Immunohistochemistry  of  paraffin  sections  from  normal  human  breast  tissue  showed  that  the 
epithin  protein  was  restricted  to  the  surface  of  ductal  epithelial  cells  where  it  co-localized  with  the 
epidermal  growth  factor  receptor.  Epithin  expression  appeared  to  be  elevated  in  DCIS  and  in 
invasive  breast  carcinoma. 
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